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Abstract 

The  development  of  conditional  propositions,  deduction 
between  conditionals,  and  boolean-like  operations  on 
conditionals,  and  their  associated  probabilities  are  here 
unified  in  terms  of  boolean  relations  of  the  form  "b  =  0“ 
defined  on  an  initially  relation-free  algebra  of  boolean 
polynomials  that  transcends  an  initial  domain  of  dis¬ 
course.  The  conditional  proposition  (alb)  is  assigned  the 
conditional  probability  P(a\b),  which  is  different  from  the 
probability  that  (alb)  is  a  tautology.  The  resulting  alge¬ 
braic  techniques  are  demonstrated  in  several  examples  such 
as  by  simplifying  a  circular  rule-based  expert  system  and 
removing  its  circularity  and  by  deriving  logical  and  prob¬ 
ability  formulas  for  keeping  communication  lines  open. 

Introduction 

Among  the  various  concepts1  residing  at  the  nexus  of  so 
many  intellectual  subtleties  that  have  come  into  scientific 
consciousness  because  of  our  efforts  to  replicate  human 
information  processing  in  computers  (Artificial  Intelli¬ 
gence),  none  is  more  daunting  than  the  central  notion  of  a 
conditional  proposition  (in  logic),  a  conditional  event  (in 
probability)  -  conditional  information  in  general. 

Conditional  Information 

Actually,  all  information  is  inherently  conditional:  It  is 
quite  impossible  to  propound  a  proposition  without 
assumptions!  These  assumptions  are  the  conditions  or 
context  of  the  information.  However,  some  assumptions 
are  always  implicit  rather  than  explicit.  This  fact  of 
intellectual  life  is  enshrined  by  science  and  philosophy  in 
the  axiomatic  method,  which  ever  seeks  to  prove  conse¬ 
quences  from  assumed  postulates.  The  final  premise  of 
Science  is  material  existence  and  knowledge  thereof;  the 
final  premise  of  religion  is  God  and  the  knowledge  there¬ 
of.  The  premise  of  cosmology  is  the  whole  of  material 
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reality  and  the  knowledge  thereof;  the  premise  of  philo¬ 
sophy  is  the  whole  of  all  reality  and  the  ability  to  know 
it.  So  it  is  impossible  to  escape  from  conditions.  Con¬ 
ditional  propositions  are  a  basic  intellectual  element  more 
so  than  so-called  "unconditioned"  boolean  propositions. 

Another  way  to  say  this  is  that  all  information  has 
context,  which  is  just  another  word  for  conditions  or 
premises.  So  in  combining  information  with  differing 
contexts,  those  contexts  must  be  carried  along  with  the 
consequent  action  (or  equivalent  proposition)  that  together 
form  the  premise-conclusion  pair.  This  is  a  fundamental 
unit  of  thought,  every  proposition  is  really  such  a  pair. 
When  a  set  of  propositions  has  a  common  context  we  tend 
to  drop  the  context  and  represent  just  the  consequent 
proposition.  Thus  an  infinite  regress  is  avoided.  In  the 
final  analysis  all  information  has  a  common  context, 
namely  the  whole  universe  of  mind  &  matter  in  which 
that  information  fits  and  has  its  meaning. 

Truth,  Implication  and  Uncertainty.  Now 
matters  are  well  understood  as  long  as  conditional 
propositions  are  evaluated  as  being  "true"  according  as 
they  are  true  in  all  cases,  but  false  if  even  one  counter¬ 
example  can  be  found,  one  case  in  which  the  antecedent  is 
true  but  the  consequent  is  false.  A  conditional  "if  b  then 
a"  is  wholly  true,  true  in  all  cases,  a  theorem,  a  so-called 
tautology,  necessarily  true  -  whenever  every  instance 
(case)  of  b  is  also  an  instance  of  a.  In  symbols,  b  <  a. 
Letting  a  mean  "and",  v  mean  "or"  and  mean  "not",  this 
can  also  be  said  in  several  other  equivalent  ways:  "In  all 
cases  either  proposition  a  is  true  or  proposition  b  is 
false";  that  is,  a  v  ~>b  =  1 .  Equivalently,  aAb  =  b,  avb 
=  a,  -~a  a  b  =  0. 


But  when  some  uncertainty  is  introduced,  the  subtle 
difficulties  that  arise  become  almost  insuperable.  For 
instance,  while  standard  probability  theory  has  quite 
adequately  quantified  the  partial  truth  in  "unconditioned" 
statements  residing  in  any  measurable  boolean  algebra,  the 
treatment  of  conditionals  in  standard  probability  theory  is 
decidedly  rudimentary.2  There  is  only  a  "conditional 
probability",  P(alb),  of  "a  given  b”.  which  is  the  ratio  of 
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Those  who  have  made  serious  attempts,  with  some  success,  ’icatioo 

to  algebraically  develop  conditionals  include  G  Boole  [2],  B.  - - 

Russell  [18],  B.  De  Finetti  [8],  Ernest  Adams  [1],  G.  Schay 
[19],  D.  Lewis  [14],  N.  Nilsson  [15],  P  Calabrese  [3],  T  7  — 

Hailperin  [10]  and  I.R.  Goodman,  H  T  Nguyen  &  E.  Walker  ul'or1« 
[12]  and  others,  but  the  list  is  not  long  l  Availabili' 
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the  probabilities  of  the  instances  of  (a  a  b)  to  the 
probability  of  the  instances  of  b.  While  the  conditional 
probability,  P(alb),  is  essential  to  probabilists  for 
quantifying  the  partial  truth  of  (alb),  its  divergence3  from 
P(a  v  -.b)  whenever  P(b)  and  P(alb)  are  both  less  than  1 
dramatizes  the  fact  that  new  distinctions  must  be  made  in 
the  realm  of  conditionals  and  new  structures  identified  as 
soon  as  non-trivial  amounts  of  uncertainty  are  incorpor¬ 
ated.  No  longer  can  the  material  conditional  proposition 
(a  v  -ib)  be  used  to  measure  the  truth  of  a  conditional  (alb) 
in  all  circumstances.4  It  is  also  unfortunate  that  so  many 
authors  refer  to  (a  v  -.b)  as  "material  implication"  when  in 
fact  (a  v  -,b)  is  not  an  implication  -  notwithstanding  that 
it  is  often  expressed  as  ”bz>  a";  rather,  (a  v  -b)  is  a  prop¬ 
osition  or  an  event  By  contrast,  material  implication  is  a 
relation  defined  by  the  boolean  equation  (a  v  -,b  =  1)  or 
equivalently  by  (-*  a  b  =  0).  There  is  all  too  little  expli¬ 
cit  distinction  made  between  absolutely  true  statements 
and  higher  order  statements  of  the  absolute  truth  of  a 
statement.  The  statement  that  proposition  b  is  wholly 
true  is  not  the  same  as  the  (wholly  true)  statement  b.  This 
distinction  is  crucial  when  it  comes  time  to  put  a  separate 
probability  measure  on  the  various  possible  deductively 
closed  sets  of  propositions  and  conditional  propositions 
that  may  presently  form  the  assumed  context  of  some 
other  set  of  uncertain  propositions  &  conditional  proposi¬ 
tions.  This  is  appropriate  in  2-stage  experiments  or  in 
other  time-indexed  information  processing. 

An  Algebra  of  Conditional  Information.  Unlike 
the  standard  probability  theory  of  unconditioned 
propositions,  there  is  no  standard  algebra  of  conditionals 
by  which  to  manipulate  or  simplify  a  complex  expression 
involving  conditionals  before  performing  a  probability 
calculation.  Some  researchers  believe  that  it  is 
unnecessary  to  combine  conditionals  directly  by  "and"  and 
"or".  Others  see  no  need  for  iterated  (nested)  conditionals. 

Yet,  even  so  common  a  parental  admonition  as  "In  the 
rain  wear  a  raincoat,  and  when  it's  cold  wear  warm 
clothes"  is  already  a  conjunction  of  two  conditionals 
which  can  be  violated  in  either  of  two  ways  and  satisfied 
by  wearing  an  insulated  raincoat  in  cold  rainy  weather. 
When  relationships  exist  between  the  components  of 
conditionals,  it  can  hardly  be  right  to  ignore  them  in  our 
logical  and  probabilistic  calculations.  Furthermore,  nested 
conditionals  are  essential  for  deduction  between 


3  This  difference  is  P(a  v  -.b)  -  P(a)b)  =  (1  -  P(b))(l  -  P(alb)) 
as  expressed  by  P.  Calabrese  [3]  in  1974. 

4  D.  Lewis  f  14]  early  showed  that  (alb)  could  not  be  assigned 
the  probability  P(alb)  and  also  be  an  element  of  the  original 
boolean  algebra  containing  a  and  b  because  then  P(alb)  = 
P((alb)  a  a))  +  P((alb)  a  -ia))  =  P((alb)  I  a)  P(a)  +  P((alb)  I  -.a)  = 
...  =  l  x  P(a)  +  0  x  P(-<a)  =  P(a),  no  matter  what  (except  for 
trivial  cases)  the  propositions  a  &  b. 


conditionals  and  for  conditioning  upon  new  possibly 
conditional  information.  Finally,  we  all  routinely  use 
such  constructions  as  "if  whenever  you  walk  across  the 
street  you  look  both  ways,  then  you  will  cross  safely." 

In  this  regard  E.  Adams  [1]  has  posed  an  interesting 
example:  An  object  of  unknown  color  may  be  red  (r), 
yellow  (y)  or  blue  (b)  with  equal  probability.  What  is  the 
new  probability  of  blue  upon  learning  that  'if  the  object  is 
not  red  it  is  blue’?  Thus,  P(  b  I  (b  I  -.r)  )  =  ?  and  more 
importantly,  how  should  this  probability  be  calculated? 

Conditionals  also  arise  naturally  in  expert  systems,  which 
tend  to  be  organized  in  terms  of  "if  -  then  -”  rules. 
Difficulties  arise  when  these  sets  of  rules  are  circular  or 
inconsistent;  simplification  can  identify  and  eliminate 
redundancies  and  inconsistencies  and  can  also  relieve 
computational  complexities.  Conditionals  can  also  be 
useful  in  managing  data  bases,  combining  data, 
facilitating  queries  and  quantifying  partial  truth. 

Overview.  A  section  on  finite  theory,  illustrated  with 
simple  examples,  is  followed  by  an  applications  section 
containing  somewhat  more  elaborate  examples.  The 
theory  section  provides  a  new,  fundamental  approach  to 
conditional  propositions  as  residue  classes  of  boolean 
relations  of  the  form  (b  =  0)  on  an  initially  relation-free 
boolean  polynomial  domain  as  generated  by  a  finite  set  of 
propositions  of  interest.  This  allows  a  new,  unified  devel¬ 
opment  of  conditional  propositions  and  iterated  condition¬ 
als  and  even  deduction  between  conditionals,  as  well  as 
boolean-like  operations  thereon,  as  previously  formulated 
in  Calabrese  [4,5,6].  Conditionals  that  are  themselves 
conditions  for  some  other  proposition  act  like  their  corres¬ 
ponding  material  conditionals.  The  concepts  of  condition¬ 
al  implication  (<c)  and  probabilistically-monotonic  impli¬ 
cation  (<m)  also  reduce  to  boolean  relations  of  this  same 
form  and  thus  are  available  for  building  deductively  closed 
systems  (algebraic  filters)  of  conditional  propositions. 
The  difference  between  P(c)  versus  P(c  =  1 ),  and  between 
P(c  v  -id)  versus  P(c  v  ->d  =  1)  is  algebraically  defined  in 
preparation  for  a  two-stage  experiment  in  which  these 
must  be  distinguished.  Conditional  propositions  are  com¬ 
bined  by  "and",  "or"  and  "not"  in  a  fresh  account  based  on 
boolean  filter  theory  and  boolean  relations.  The  probabil¬ 
ity  formula  P(a  v  b)  =  P(a)  +  P(b)  -  P(a  a  b)  is  general¬ 
ized  to  conditionals  in  preparation  for  its  later  use  in  a 
communication  link  example.  The  applications  section 
includes  three  expert  system  (if  -  then  - )  rules  that  result 
in  a  circular,  rule-chaining  system  but  which  upon  being 
combined  into  a  single  rule,  need  no  further  chaining, 
thereby  avoiding  an  infinite  do-loop  at  execution  time. 

Finite  Theory  of  Conditional  Information 
The  development  of  conditional  propositions  presented 
here  will  incidentally  avoid  the  triviality  situations 
exhibited  by  D.  Lewis  [14]  by  embedding  the  original 
boolean  propositions  in  a  larger,  generally  non-boolean. 
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space  of  ordered  pairs  of  propositions.  As  exhibited  in  P. 
Calabrese  [3, 4, 5, 6,7],  the  structure  of  these  ordered  pairs 
(conditional  propositions)  allows  extended  operations  of 
conjunction  (a),  disjunction  (v),  negation(-.)  and  condi¬ 
tioning  ( 1 )  in  a  way  that  is  non-trivially  consistent  with 
assigning  each  conditional  proposition  or  event,  (alb)  the 
probability  P(alb);  the  probability  that  (alb)  is  a  tautology 
may  have  some  other  probability. 

Besides  the  ordered-pairs  construction,  conditional  proposi¬ 
tions  can  be  algebraically  defined  in  several  other  equiva¬ 
lent  ways  and  there  is  also  more  than  one  plausible  (non¬ 
equivalent)  way  to  define  operations  on  the  resulting  cond¬ 
itionals.  For  an  overview  of  these  approaches  see  D. 
Dubois  &  H.  Prade  (9]  and  I.  R.  Goodman  et  al  [12,13]. 
The  approach  here  is  consistent  in  spirit  with  earlier  work 
of  G.  Schay  [19]  and  E.  Adams  [i],  but  also  differs 
extensively  in  content  and  emphasis.  This  is  still  a 
young  topic. 

Propositions  and  Events 

For  ease  of  application,  especially  in  computers,  we 
restrict  attention  to  the  finite  case.  Let  S  be  a  finite  set  of 
n  propositions  aj,  a2,  ....  an  of  interest.  Assume  that 
none  of  the  negations,  -tai,  of  these  a,  appears  elsewhere 
in  the  list  of  n  initial  propositions.  In  order  to  transcend 
the  domain  of  discourse  S,  form  S(xi,  x^,  ....  xn),  the 
free  boolean  algebra  of  polynomials  generated  by  these  n 
propositions  (when  taken  as  relation-free  variables)  under 
negation  (-0,  conjunction  (a),  and  disjunction  (v).  Thus 
B  is  the  set  of  all  finite  disjunctions  of  conjunctions  of  n 
free  propositional  variables  xi,  X2, ....  xn  or  their  nega- 
ions.  An  arbitrary  element  of  B  is  the  finite  disjunction 
of  any  subset  of  (non-zero)  atomic  propositions  to  of  the 
form  eixi  a  e2X2  a  ...  a  enxn  where,  for  i  <  n,  each  eix, 
is  either  xi  or  -iXi.  Let  ft  be  the  set  of  all  such  atoms  ©. 
ft  has  2n  elements  and  so  B ,  which  corresponds  to  the 
collection  of  all  subsets  of  atoms  of  ft,  contains  2  raised 
to  the  power  2n  propositions.  The  propositions  of  B  also 
correspond  to  the  collection  of  all  possible  ordered  2n  - 
tuples  of  Os  and  Is.  Alternately,  B  can  be  expressed  as  all 
possible  two-valued  functions  from  ft  to  {0,1}.  In  the 
standard  way  assign  probabilities  to  the  atoms  of  ft  and  so 
to  the  propositions  of  B. 

For  example,  suppose  S  =  [Blue  (b),  Red  (r).  Yellow  (y)} 
represents  the  possible  colors  of  an  object.  Then  ft  =  (b 
a  r  a  y,  -ib  a  r  a  y,  b  a  -ir  a  y,  -ib  a  -j  a  y,  b  a  r  a  — y, 
-ib  a  r  a  -iy,  b  a  -ir  a  -y,  ->b  a  -r  a  -y),  which  will  be 
abbreviated  by  dropping  the  a  symbols  and  writing  b'  for 
-ib.  So  B  contains  all  of  the  28  subsets  of  ft.  Initially 
assign  probability  1/8  to  each  of  the  elements  of  ft. 


With  respect  to  machine  storage  and  computation  time,  an 
n-tuple  of  binary  storage  bits  requires  only  n  ordered 
binary  storage  places  of  computer  memory  and  yet  is 
capable  of  taking  the  value  of  any  one  of  2n  atoms. 
Unfortunately,  a  proposition  requires  an  ordered  2n  -  tuple 
to  be  specified,  corresponding  to  that  subset  of  the  2n 
atoms  which  the  proposition  contains.  For  n  =  10,  that 
requires  a  binary  vector  of  length  1024.  Thus  for 
purposes  of  computational  tractability  it  may  be  necessary 
to  divide  large  numbers  of  propositions  into  groups  of 
related  propositions  with  no  group  having  more  than  a 
specified  maximum.  Alternately,  or  in  addition,  a 
judicious  choice  of  hierarchic  variables  (objects)  can  be 
defined  to  manage  computational  complexity. 

Conditional  Propositions 

The  conditional  propositions  (ordered  pairs  of 
propositions)  will  now  be  generated.  In  the  context  set 
forth  above,  the  process  of  conditioning  a  proposition  c 
upon  another  proposition  b  can  be  performed  by  simply 
setting  to  0  all  atoms  of  c  in  common  with  -ib. 

Conditionalization  Relations  Rb«  More  formally, 
temporarily  fix  an  arbitrary  proposition  b  in  B  and  form 
the  quotient  boolean  algebra  (B/b)  under  the  congruence 
relation  Rb  defined  by  the  boolean  equation  (— .b  =  0),  as 
follows:  For  any  two  propositions  c  and  d  in  B  ,  the 
ordered  pair  (c,d)  is  defined  to  be  in  the  relation  Rb  if  c  and 
d  are  equal  after  setting  to  0  any  atoms  that  they  have  in 
common  with  -,b.  This  amounts  to  specifying  -ib  to  be 
impossible,  which  is  also  to  say  that  b  is  to  be  necessary. 
When  propositions  are  represented  as  two-valued  functions 
from  ft  to  [0,1],  this  is  equivalent  to  setting  b_I(0)  =  {© 
g  ft:  b(©)  =  0}  =  [0],  which  is  a  form  that  easily  extends 
to  conditioning  by  conditionals. 

Clearly,  Rb  is  an  equivalence  relation  and  a  congruence 
relation  (=)  using  the  fact  that  two  propositions  are  equal 
if  and  only  if  they  have  the  same  atoms. 

Denote  by  (B/b)  the  quotient  boolean  algebra  thus  formed, 
and  denote  by  (clb)  the  residue  class  in  (B/b)  containing 
the  proposition  c.5  It  follows  that  (clb)  =  (dlb)  if  and  only 
ifCAb=dAb.  We  interpret  (clb)  as  "c  given  b”  or  "c  in 
case  b  is  true”  or  "c  given  ->b  is  false”  or  as  ”c  modulo 
(-ib  =  0)",  or  "c  modulo  (b  =  1)".  Alternately,  (clb)  is  c 
"given  the  falsity  of  -.b”  or  "c  given  the  truth  of  b”. 

Note  that  with  respect  to  the  set  of  all  atoms  ft,  (clb)  is  a 
two-valued  function  restricted  to  the  atoms  in  b.  (clb) 


^  If  b  is  already  0  before  forming  (B/b)  then  (B/b) 
degenerates  to  the  singleton  (0).  which  is  the  "inconsistent 
boolean  algebra”  in  which  1  =  0. 
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assigns  1  to  those  atoms  in  (c  a  b)  and  0  to  the  atoms  in 
(-.c  a  b).  (clb)  is  undefined  (or  "inapplicable”)  on  the 
atoms  in  -ib  because  the  construction  of  (clb)  assumes  that 
they  are  no  longer  atoms  having  been  set  to  0,  which  is 
not  an  atom.  So,  as  early  pointed  out  by  B.  De  Finetti 
[8],  with  respect  to  the  original  boolean  algebra  B,  a 
conditional  proposition  (clb)  has  three  truth  "states":  1,  0 
and  "undefined”.  For  any  proposition  c,  the  probability  of 
(clb)  is  set  to  P(c  a  b)  /  P(b)  so  that  P(b!b)  -  1 . 

The  congruence  class  containing  c  is  (clb)  =  {x  e  B:  x  a  b 
=  c  a  b)  =  (x  e  B:x  =  (CAb)v  (-.b  a  y)  for  some  y  in 
B)  =  cb  v -,bB,  where  conjunction  has  been  replaced  by 
juxtaposition  and  -it®  =  {— ib  a  y:  y  e  B } . 

The  congruence  class  (bib)  =  {x  e  B:  x  a  b  =  b  a  b}  =  {x 
e  B:  b  <  x)  is  a  boolean  filter,  a  deductively  dosed  set  of 
propositions,  because  the  class  is  closed  under 
conjunction,  and  also  closed  under  boolean  deduction  < 

The  quotient  boolean  algebra  BA)  inherits  a  natural 
deduction  ordering  <  from  B,  namely  that  (alb)  <  (clb)  if 
and  only  if  ab  <  cb. 

The  Conditional  Closure  of  B .  Let  B  /B  denote 
the  set  of  all  conditional  propositions  (alb)  for  arbitrary 
propositions  a  and  b  in  B.  B/B  will  be  called  the 
conditional  closure  of  B.  Note  that  B  is  isomorphic  to 
B/l,  so  that  B/B  includes  a  representative  of  B.  Two 
conditional  propositions  (alb)  and  (dd)  are  equivalent  in 
B/B  if  (b  =  d)  and  (a  a  b  =  c  a  d).  Thus  in  general  (alb) 
=  (a  a  b  I  b),  which  has  been  called  "reduction".  However, 
the  further  extension  of  this  equation  to  cond-itionals  as 
(alb)  I  (cld)  =  [(alb)  a  (cld)]  I  (cld)  is  questionable  since  (alb) 

I  (cld)  can  also  be  rendered  as  (a  I  b  a  (cld)). 

Continuing  the  colored  object  example,  if  it  is  known  that 
the  object  has  exactly  one  of  the  3  colors  then  the  free 
boolean  algebra  B  must  be  conditioned  by  the  information 
that  (br  v  by  v  ry  v  b'r'y')  =  0.  So  Q  becomes  (br'y’, 
b’yr\  b'y’r)  and  B  now  has  23  =  8  non-zero  elements.  In 
that  situation  the  (conditional)  probability  of  each  of  the 
three  non-zero  elements  becomes  (1/8)  /  (3/8)  =1/3. 

Iterated  (Nested)  Conditionals.  The  virtue  of  this 
manner  of  development  of  B/b  via  boolean  relations 
defined  by  a  boolean  equation  (— ib  =  0)  is  that  the 
conditionalization  process  can  be  naturally  extended  to 
iterated  conditional  propositions  of  the  general  form  (alb)  I 
(cld).  While  there  may  be  other  ways  to  perform  iterated 
conditionalization,  the  intention  here  is  to  explore  the 
following  type:  Once  B(xj,  X2,  ....  xn)  has  been 


constructed  to  transcend  the  initial  set  S  of  n  propositions, 
all  subsequent  conditioning  is  applied  to  this  original 
polynomial  boolean  algebra  B.  Thus  all  subsequent 
conditions  are  applied  to  B,  and  hence  to  all  structures 
built  from  B  including  B  /B .  Of  course  if  new 
propositions  are  added  to  the  initial  set  of  propositions  S, 
then  B  must  be  expanded  accordingly. 

Consider  first  the  special  case  ((alb)  I  c).  (alb)  is  the 
proposition  a  with  all  of  its  atoms  in  common  with  -ib 
set  to  0,  and  so  (alb)  I  c  is  the  resulting  proposition  after 
all  of  its  atoms  in  common  with  ->c  are  also  set  to  0.  But 
(-ib  =  0)  and  (-*c  =  0)  if  and  only  if  (— .b  v  -.c  =  0).  So 
((alb)  I  c)  is  the  proposition  a  with  all  atoms  in  common 
with  (-.b  v  -*:)  =  -,(b  a  c)  set  to  0.  So  ((alb)  I  c)  =  (a  I 
(be)).  This  has  been  called  the  "import-export"  principle. 

Iterated  Conditionals  of  the  form  ((alb)  I  c). 
More  formally,  for  any  two  conditional  propositions  (alb) 
and  (glh)  in  B/B,  the  ordered  pair  ((alb),  (glh))  is  defined  to 
be  in  the  relation  Rc  if  (alb)  and  (glh)  are  equal  after 
setting  to  0  any  atoms  that  their  components  a,  b,  g  and  h 
have  in  common  with  -ic.  Clearly  this  forms  an 
equivalence  relation  on  B/B.  It  easily  follows  that  mod¬ 
ulo  R«  each  conditional  proposition  (alb)  is  equivalent  to 
(aclbc),  which  is  equivalent  to  (albc).  Thus  ((alb)  I  c)  =  ((a 
I  be)  I  c),  which  is  just  (a  I  be)  because  -ic  <  -i(bc)  and  so 
die  atoms  of  -ic  are  already  zeroed  as  a  part  of  the  atoms  of 
-<bc)  being  zeroed.  It  also  follows  that  Rc  is  a  congruence 
relation  on  B/B  because  if  (alb)  =  (glh),  and  so  b  =  h  and 
ab  s*  gb,  then  be  =  he  and  abc  =  ghe,  and  so  (albc)  =  (glhc). 

Iterated  conditionalization  by  successive  propositions 
amounts  to  setting  additional  atoms  in  £1  to  0  and  thus  the 
process  stays  inside  B/B,  which  contains  all  possible  such 
zeroing  of  atoms. 

Another  way  to  formally  derive  that  ((alb)  I  c)  =  (a  I  be)  is 
via  the  algebraic  congruence  class  representation: 
((alb)  I  c)  =  ((ab  v  ->bB)  1  c)  =  (ab  v  -,bB)c  v  ->cB  = 
abc  v  -ibcB  v  -,cB  =  abc  v  (-,bc  v  -c)B  =  abc  v 
(->b  v  -c)B  =  abc  v  -<bc)B  =  (abc  I  be)  =  (albc). 

Iterated  Conditionals  of  the  form  (a  i  (cld)).  To 
say  "given  (cld)”  or  "in  case  (cld)"  is  to  assume  the  non¬ 
violation  of  (cld).  Since  (cld)  is  violated  on  (-,c  a  d),  this 
amounts  to  setting  to  zero  all  atoms  of  proposition  a  in 
common  with  (-*;  a  d).  So  (a  I  (cld))  =  (a  I  (c  v  — ,d)) 
because  conditioning  by  (c  v  -d)  has  the  same  result. 

Conditionalization  Relations  R(cid)-  More 
formally,  for  any  two  propositions  a  and  b  in  B ,  the 
ordered  pair  (a,b)  is  defined  to  be  in  the  relation  R(Cid)  if  a 
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and  b  are  equal  after  setting  to  0  any  atoms  that  they  have 
in  common  with  (dd)'l(0),  which  is  the  set  of  atoms  for 
which  (dd)  is  violated.  As  in  the  simple  conditional  case, 
this  means  that  we  must  set  (cldJ'^O)  =  {0}.  However, 
(cld)*l(0)  =  (to  e  Q:  to  is  an  atom  of  (-iC  Ad)}.  So  the 
atoms  of  (— iC  a  d)  are  just  the  atoms  to  be  set  to  zero,  and 
thus  (a  I  (dd))  =  (a  I  (c  v  -d)  since  (c  v  -d)  =  ->(-<:  a  d). 
It  is  easy  to  see  that  R<cid)  >s  a  congruence  relation  on  B . 

For  example,  returning  to  E.  Adam's  colored  object 
example,  upon  learning  that  "if  the  object  is  not  red  then 
it  is  blue",  the  new  (conditional)  event  for  "blue  (b)"  is 
(b  1  (b  I  —it))  =  (b  I  (b  v  r)),  and  the  new  probability  for  b  is 
P(b  1  (b  v  r))  =  P(b)  /  P(b  v  r)  =  (1/3)  /  (2/3)  =  1/2. 

Iterated  Conditionals  of  the  form  ((alb)  I  (cld). 
On  the  face  of  it,  by  observing  the  arrangement  of 
parentheses,  ((alb)  I  (cld)  is  a  conditioning  of  proposition 
a  by  proposition  b  followed  by  a  conditioning  of  the 
result  by  (dd).  Thus  ((alb)  I  (dd)  is  proposition  a  after  its 
atoms  in  common  with  ->b  are  set  to  zero  and  then  after 
its  atoms  in  common  with  (-c  a  d)  are  also  set  to  zero, 
which  is  just  {(a  I  b)  I  (c  v  — d)]  =  (a  I  (b  a  (c  v  -d))),  and 
the  latter  conditional  proposition  is  in  S/B.  Thus  B/B  is 
closed  under  such  iterated  conditioning.  We  have  then  that 

((alb)  I  (cld)  =  (a  I  (b  a  (c  v  -nd)))  (1) 

Boolean  Relations.  The  conditional  closure  B/B  can 
be  conditioned  by  any  boolean  relation  which  is 
expressible  in  the  form  "b  =  0”.  These  allow  known  rela¬ 
tionships  between  the  propositions  of  B  such  as  "c  <  d", 
"cd  =  0",  and  "cvd=  1"  and  similar  relations  on  condi¬ 
tionals  to  be  incorporated  in  terms  of  conditionalization. 

This  also  opens  up  connections  between  this  theory  of 
conditional  event  algebra  /  conditional  probability 
(CEAPL)  and  the  work  of  J.  Hooker  [1 1]  concerning  tech¬ 
niques  for  solving  systems  of  boolean  equations;  in  many 
situations,  a  set  of  these  boolean  equations  will  need  to  be 
simultaneously  solved  in  order  to  "define  the  condition"  as 
specified  by  a  set  of  known  boolean  relationships  between 
specified  propositions  and  conditional  propositions. 

Deduction 

Since  a  conditional  proposition  (cld),  when  acting  itself  as 
a  conditional,  is  equivalent  to  its  corresponding  material 
conditional  (c  v  ->d),  a  collection  (ailbi),  (a2lb2),  (a3lb3), 
....  (aotlbm)  of  conditionals  generates  an  algebraic  filter, 
namely  the  deductively  closed  system  generated  by  the 
proposition  (aj  v  — .bi)  a  (a2  v  -ib2)  a  (a3  v  -,b3)  a  ...  a 
(am  v  — <bm).  In  particular,  given  one  conditional  (alb),  a 
second  conditional  (cld)  is  also  deductively  "given"  (that 
is,  a  tautology)  if  and  only  if  (a  v  -ib)  <  (c  v  -d).  This 


kind  of  implication  between  conditionals  has  been  called 
conditional  implication  (<c).  As  defined  in  [6], 

(alb)  <c  (cld)  if  and  only  if  (a  v  — ,b)  <  (c  v  -id).  (2) 

(alb)  <c  (cld)  means  that  "if  (alb)  is  not  false  then  (cld)  is 
not  false."  Equivalently,  it  means  that  “if  (cld)  is  false 
then  (alb)  is  false.” 

Note  that  with  respect  to  probability,  conditional 
implication  (<c)  only  ensures  that  P(-icd)  <  P(-rab)  or 
equivalently  that  P(a  v  — >b)  <  P(c  v  -4)  not  that  P(alb)  < 
P(cld),  which  is  stronger,  in  general  requiring  also  that  ab 
<  cd.  When  both  (alb)  <c  (cld)  and  ab  <  cd  hold,  the 
implication  has  been  called  monotonic  implication  by  the 
author  [6]  since  it  is  probabilistically  monotonic.6  The 
conditional  implication  (alb)  <c  (cld)  can  be  expressed  by 
the  boolean  equation  (-icd)(a  v  ->b)  =  0  and  monotonic 
implication  (alb)  <m  (cld)  can  be  expressed,  for  instance, 
by  the  equation  (-cdKa  v  — .b)  v  (abWcd)  =  0. 

For  Adams  [1],  (alb)  can  be  "validly  inferred"  from  (cld)  if 
and  only  if  (cld)  has  high  conditional  probability  whenever 
(alb)  has  high  conditional  probability,  and  this  is 
equivalent  to  monotonic  implication,  but  Adams 
generalizes  by  allowing  the  premise  to  be  any  finite 
conjunction  of  conditionals. 

Two  conditionals  (alb)  and  (cld)  are  said  to  be  conditionally 
equivalent  (=c),  that  is,  equivalent  as  conditions,  if  and 
only  if  (a  v  — .b)  =  (c  v  -d). 

If  (alb)  <c  (cld)  then  (cld)  I  (alb)  is  a  tautology  because  (dd) 
I  (alb)  =  (c  I  d  a  (a  v  -ib))  =  (c  I  d  a  (c  v  -id))  =  (c  I  d  a  c)  = 
(d  a  c  I  d  a  c)  =  (1  I  d  a  c).  Conversely,  if  (cld)  I  (alb)  is  a 
tautology  then  (a  v  ~>b)  <  (c  v  -id)  because  then  (cld)  I  (alb) 
=  (lid)  I  (alb)  and  so  cd(a  v  -,b)  =  d(a  v  -.b),  which  means 
d(a  v  -ib)  <  c,  which  means  (a  v  -ib)  <  (c  v  -id). 

Note  that  since  (c  v  -id)  =  -id  v  -i(-ic),  a  conditional 
proposition  (cld),  "if  d  then  c",  is  conditionally  equivalent 
to  its  contrapositive  (-d  I  -ic),  "if  not  c  then  not  d".  (cld) 
=c  (-id  I  -c)  because  (d  <  c)  if  and  only  if  (-ic  <  -d).  This 
is,  after  all,  a  fact  of  everyday  mathematical  life  (except 
for  those  who  resolutely  abhor  all  proofs  by 
contradiction.)  However,  (cld)  *  (-id  I  -ic)  and  P(cld)  * 
P(-id  I  -ic)  except  in  very  special  cases. 

Since  by  definition  every  deductively  closed  set  D  contains 
(alb)  a  (cld)  whenever  it  contains  both  (alb)  and  (cld),  a  set 


6  I.R.  Goodman  [12]  first  showed  (hat  (bis  implication 
relation  between  conditionals  is  also  the  weakest  which  is 
probabilistically  monotonic  for  all  probability  measures 
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of  m  conditionals  deductively  entails  all  2m  of  its  subset 
conjunctions.  D  also  contains  (cld)  whenever  it  contains 
(alb)  and  (alb)  ^x  (cld)  bolds,  where  relation  <x  is  <c,  <m, 
or  some  other  boolean  deduction  relation  in  the  sense  that 
it  extends  ^  on  B/b  for  every  b  e  B.  That  is,  (alb)  <x 
(clb)  if  and  only  if  ab  £  cb.  Furthermore,  modus  ponens 
is  also  valid  in  the  conditional  closure  in  the  sense  that 
(alb)  a  ((cld)  I  (alb)]  =  (alb)  a  (cld). 

All  this  reduces  the  problem  of  deduction  in  the  realm  of 
conditionals  to  the  well-understood  boolean  deduction, 
much  like  the  system  of  N.  Nilsson  [15],  in  which  all 
conditionals  (alb),  not  just  those  acting  as  conditions,  are 
reduced  to  their  corresponding  material  conditionals  (a  v 
-ib).  In  this  context,  probabilities  of  conditionals  (cld)  are 
probabilities  of  their  corresponding  material  conditionals 
(a  v  -ib)  since  both  are  false  on  the  same  set  of  atoms, 
namely  those  of  (-.a  a  b).  That  is,  P(  (alb) )  =  P(a  v  -ib). 

Tautologies.  In  view  of  the  above,  a  probability  can  be 
either  of  the  following  two  important  types: 

1)  Probability  that  a  proposition  is  true,  and 

2)  Probability  that  a  proposition  is  a  tautology. 

For  a  proposition  c,  these  types  of  probability  are 

1)  P(c)  =  P{gi  e  Cl:  ©  is  an  atom  of  proposition  c} 

=  P(c-1(l)),and 

2)  P((c))  =  P(proposition  c  is  a  tautology)  =  P(c  =  1) 

=  P(principle  filter  generated  by  (c)  is  given). 

For  conditionals,  the  first  type  is  P(cld)  =  P(cd)  /  P(d)  = 
P((cd)'1(l))  /  P(<J~1(1)),  called  the  conditional  probability, 
while  the  second  type  is  P((cld)  is  a  tautology)  =  P(d  <  c) 
=  P(c  V  -rd  =  1). 

Again,  in  the  second  type,  P(  (cld)  =  (lid) )  =  P(  (c  v  -id)  = 
1 ),  as  in  the  standard  logical  reduction,  because  (cld)  is  a 
tautology  if  and  only  if  (c  v  -id)  is  a  tautology.  In  the 
first  type,  P(cld)  <  P(c  v  -d)  except  when  P(d)  or  P(cld)  is 
1.  Not  noticing  this  distinction  has  been  a  great  source  of 
confusion  between  logicians  and  probabilists. 

This  requires  P  to  be  extended  to  the  space  of  filters  of  B, 
which  form  a  boolean  algebra.  For  filters  J  and  K, 

(J  v  K)  =  (a  v  b:  a  e  J.  b  e  K)  =  J  n  K 

(J  a  K)  =  {a  a  b:  a  e  J,  b  e  K)  2  J  u  K 

J'  =  {b  g  B:  -ia  <  b,  for  all  a  e  J} 

(0)  =  B 
(1)  =  (I) 


Then  J  a  J'  =  (0),  J  v  J*  =  (1),  and  (JT  =  J.  This 
allows  P  to  apply  also  to  the  likelihood  of  tautologies, 
which  is  different  from  the  likelihood  of  propositions. 
Deduction  between  filters  "if  J  is  a  tautology  then  K  is  a 
tautology"  has  been  called  necessary  implication  (<n),  the 
"entailment  of  necessity":  if  J  =  (1)  that  K  =  (1).  In 
general  (alb)  <n  (cld)  means  that  (b  <  a)  entails  (d  <  c). 

Other  Operations  on  Conditionals 
As  mentioned  previously,  operations  of  negation  (— i), 
conjunction  (a)  and  disjunction  (v)  have  been  plausibly 
defined  on  the  conditionals  of  B/B  in  more  than  one  way. 
I.  R.  Goodman  [12]  has  shown  that  choosing  such  an 
operation  on  B/B  corresponds  to  choosing  a  3-valued  truth 
table  on  B/B  and  conversely.  Here,  the  main  considera¬ 
tion  is  to  be  consistent  with  probability  while  developing 
an  algebra  of  logical  conditionals  which  is  consistent  with 
common  usage.  The  spirit  follows  the  development  of 
conditionals  by  E.  Adams  [1]  although  it  also  differs 
significantly.  For  instance,  recently  Adams  has  focused  on 
meta-conditionals  like  P(  P(cld)  =  1) ),  the  probability  that 
c  is  highly  likely  given  d.  This  is  a  generalization  of 
P((cld)  =  1),  the  probability  that  (cld)  is  a  tautology. 

Negation.  Within  any  quotient  boolean  algebra  B/b, 
each  conditional  proposition  (alb)  has  a  negation  ->(alb), 
which  is  just  (-alb)  because  (alb)  v  (-alb)  =  (lib),  and  (alb) 
a  (-alb)  =  (01b),  where  (lib)  and  (01b)  are  the  I  and  0  re¬ 
spectively  of  B/b.  Furthermore,  P(alb)  +  P(-a!b)  =  1.  So 
it  is  natural  to  keep  these  conditional  negations  in  B/b: 

— Kalb)  =  (—alb)  (3) 

However  (alb)  and  (-alb)  are  not  negations  with  respect  to 
each  other  in  B/B  except  in  the  case  that  b  =  1.  The 
negation  of  a  (principal)  filter  generated  by  the  single 
proposition  b  is  (-.b)  =  fxeB:-,b<x]  and  the 
congruence  class  containing  -a  is  (-alb). 

Disjunction.  Intuitively  speaking,  the  compound 
conditional  "if  b  then  a,  or  if  d  then  c",  in  symbols  (alb)  v 
(cld),  is  true  in  case  either  conditional  is  true,  and  it  is 
defined  (or  "applicable")  in  case  either  antecedent,  b  or  d, 
is  true.  If  (alb)  v  (cld)  is  defined  but  not  true  then  it  is 
false.  Since  (alb)  is  true  on  ab  and  (cld)  is  true  on  cd,  it 
follows  that 

(alb)  v  (cld)  =  (ab  v  cd  I  b  v  d).  (4) 

More  formally,  the  deductively  closed  set  (or  filter) 
associated  with  the  conditional  proposition  (alb)  is  (b)  = 
{x  £  B:  b<  x}  and  that  of  (cld)  is  (d)  =  {x  e  B:  d  <  x}.  If 
either  of  these  two  deductive  systems  is  "given",  then  the 
resulting  deductive  system  that  is  "given”  necessarily  in- 
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eludes  only  those  propositions  that  are  common  to  (b)  and 
(d).  But  the  largest  filter  included  in  both  filters  (b)  and  (d) 
is  (b)  n  (d)  =  (b)  v  (d)  =  (b  v  d).  So  (alb)  v  (ckl)  is  defined 
on  (b  v  d).  Within  this  domain  of  definition,  (alb)  v  (cld) 
is  true  on  (ab  v  cd).  So  equation  (4)  follows.  Alternately, 
if  all  atoms  in  common  with  ->b  are  set  to  zero,  or  all 
atoms  in  common  with  -d  are  set  to  zero,  then  the  atoms 
that  are  actually  set  to  zero  in  either  case  are  just  those  in 
common  with  (— .b  a  -id).  But  (->b  a  -d)  =  0  if  and  only  if 
->(b  v  d)  =  0.  So  again,  (alb)  v  (cld)  is  defined  on  (b  v  d). 

Conjunction.  As  in  the  case  of  disjunctions,  (alb)  a 
(cld)  is  defined  when  either  (alb)  or  (cld)  is  defined,  that  is, 
on  (b  v  d).  Within  its  domain  of  definition,  (alb)  a  (cld)  is 
violated  if  and  only  if  either  (-ia  a  b)  v  (-ic  a  d)  is  true. 
Otherwise,  on -i[(-ia  a  b)  v  (— «c  a  d)j  =  (a  v  -ib)(c  v  -d), 
it  is  not  violated.  If  it  is  defined  and  not  violated,  then  it 
is  true.  It  follows  that 

(alb)  a  (cld)  =  (av-,bXcv-d)l(bvd)  (5) 

=  (abed  v  ab-d  v  -ibed)  I  (b  v  d) 

=  [(abXcv  -d)  v  (av-ibXcd)]l(bvd). 

Note  that  the  right  hand  side  of  equation  (5)  reduces  to  that 
of  equation  (4)  when  ab<(cv  -d)  and  (cd)  <  (a  v  — ib). 
These  are  equivalent  to  saying  (ab)(-«d)  =  (-ab)(cd)  =  0. 
A  sufficient  condition  for  this  is  that  bd  =  0,  that  b  and  d 
are  disjoint.  So  in  these  situations  the  conjunction  of 
conditionals  may  be  equivalent  to  the  disjunction  of  cond¬ 
itionals  without  the  component  conditionals  being  equal. 

The  conjunction  of  two  conditionals  can  also  be  derived 
by  requiring  that  the  De  Morgan  formulas  should  hold  in 
B/B:  In  that  case  (alb)  a  (cld)  =  — .[(— ialb)  v  (-cld)]  = 
-i[(-iab  v  -cd)  I  (b  v  d)]  =  (a  v  -ib)(c  v  -d)  I  (b  v  d), 
which  reduces  to  (abed  v  ab-d  v  -,bcd)  I  (b  v  d). 
Similarly,  equation  (4)  can  be  derived  from  equation  (3) 
and  the  De  Morgan  formula.  Alternately,  the  De  Morgan 
formulas  can  be  proved  from  equations  (4)  and  (5). 

The  disjunction  and  conjunction  of  conditionals  consti¬ 
tutes  a  combining  of  partially  applicable  information,  as 
when  data  bases  are  built  up  from  overlapping  pieces  of 
information  from  different  sources  or  circumstances. 

For  example,  consider  the  familiar  experiment  of  rolling  a 
single  die  once  with  atoms  £2  =  (1,  2,  3,  4,  5,  6}.  B  is 
the  collection  of  subsets  of  £2.  A  wager  is  made  that  "if 
the  roll  is  <3  then  it  will  be  odd  and  if  the  roll  is  even 
then  it  will  a  6."  This  can  be  represented  as  (odd  I  S3)  a 
(6 1  even)  =  (0  v  (odd)(<3)(->(even))  v  -.(<3X6Xeven))  I 
(<3  v  even)  =  ({l,3}v{6}  I  {1,2, 3,4, 6))  = 


{1,  3, 6}  I  { I,  2,  3, 4,  6).  The  conditional  probability  is 
therefore  3/5,  which  can  also  be  determined  by  checking 
individual  atoms. 

According  to  these  operations,  the  conjunction  of  two 
conditionals  (alb)  and  (cld)  conditionally  implies  either  of 
the  conjuncts.  That  is,  (alb)  a  (cld)  <c  (alb).  However, 
the  disjunction  of  two  conditionals  is  not  in  general 
conditionally  implied  by  each  of  its  disjuncts.  That  is, 
(alb)  <c  (alb)  v  (cld)  does  not  hold  in  general,  requiring  in 
addition  that  ->cd  <  b,  namely  that  if  (cld)  is  false  then 
(alb)  is  defined.  Nevertheless,  (alb)  a  (cld)  <  (alb)  v  (dd) 
since  both  are  in  B/(b  v  d).  In  this  regard,  it  turns  out 
that  except  for  special  cases,  we  generally  have7 

(alb)  a  (cld)  <  (alb)  v  [(alb)  a  (cld)]  <  alb)  v  (cld) 

where  the  middle  expression  is  also  in  B/(b  v  d).  This 
middle  expression  is  true  whenever  (alb)  is  true,  even  if 
(cld)  is  false,  or  when  (alb)  is  undefined  and  (cld)  is  true, 
but  it  is  violated  when  (alb)  is  false  and  (cld)  is  true. 
Similarly  for  (alb)  a  [(alb)  v  (cld)]. 

Note  that  this  property,  for  example,  allows  (01b)  a  1  =  b 
to  have  larger  probability  in  general  than  (01b),  which 
must  have  probability  0.  Conjunction  and  disjunction  of 
conditionals  does  not  here  preserve  probability  in  the 
same  way.  (01b)  is  zero  only  in  B/b.  Outside  b,  it  has  no 
affect;  it  is  "inapplicable”.  Some  of  the  results  of  the 
{seceding  pages  can  be  summarized  as  follows: 

Theorem:  Let  S  be  any  finite  set  {ai,  a2,  ....  an]  of  n 
boolean  propositions,  no  two  of  which  are  negations  of 
one  another,  and  let  P  be  a  probability  assignment  to  the 
2n  (atomic)  elements  of  £2  =  {all  conjunctions  of  the  n 
members  of  S  or  their  negations}.  Then  the  relation-free 
boolean  algebra  B  generated  by  n  free  boolean  variables 

X],  X2 . xn  has  a  conditional  closure  B/B  that  includes 

an  isomorphic  copy  of  the  boolean  algebra  generated  by  S; 
furthermore,  B/B  is  closed  under  the  operations  of 
conjunction  (a),  disjunction  (v),  negation  ( — .)  and 
conditionalization  ( I )  according  to  equations  (3),  (4) ,  (5) 
and  (1)  respectively,  and  so  any  such  operation  on  S  is 
represented  in  B/B;  and  finally,  for  all  (alb)  in  B/B, 
P(alb)  =  P(a  a  b)  /  P(b). 

Note  that  a  formal  disjunction  of  the  congruence  classes 
(ab  v  -,bB)  and  (cd  v  -dB)  does  not  result  in  equation  (4), 
but  instead  yields  (ab  v  -bB)  v  (cd  v  -,dB)  =  (ab  v  cd 
v  -bB  v  -dB)  =  (ab  v  cd)  v  ( — >b  v  -d)B  =  (ab  v  cd) 
v  -i(bd)B  =  (ab  v  cd  I  bd),  which  is  the  disjunction 

7  (alb)  <x  (cld)  means  that  (alb)  <x  (cld)  and  (alb)*x  (cld) 
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operation  cbosen  by  G.  Schay  [19]  to  go  with  the  con¬ 
junction  operation  (S).  This  derivation  separates  the  con¬ 
sequent  (ab)  from  its  antecedent  -bB  in  a  way  that  leads  to 
the  conditional  (ab  v  cd  I  bd),  which  is  defined  only  when 
both  antecedents  are  true.  By  similar  development  the 
formal  conjunction  of  the  two  congruence  classes  (ab  v 
-bB)  and  (cd  v  ->dB)  yields  the  conditional  proposition 
(abed  I  bd  v  -.ab  v  -<cd),  which  is  the  conjunction  oper¬ 
ation  favored  by  I.R.  Goodman,  et  al  [13].  This  condi¬ 
tional,  which  is  undefined  outside  (b  v  d),  and  within  (b  v 
d)  is  undefined  on  (ab— d  v  -.bed)  is  the  greatest  lower 
bound  of  (alb)  and  (cld)  with  respect  to  <m,  and  is  also 
conditionally  equivalent  (=c)  to  the  right  hand  side  of  equa¬ 
tion  (S).  The  result  of  the  disjunction  operation  of  Good¬ 
man  et  al  upon  (alb)  and  (cld)  is  the  conditional  (ab  v  cd  I 
ab  v  cd  v  bd),  which  is  the  least  upper  bound  with  respect 
to  and  is  also  conditionally  equivalent  to  Schay's 
disjunction  mentioned  above.  When  b  and  d  are  disjoint 
thii  motion  yields  a  tautology,  the  filter  generated  by 
ab  v  cd,  namely,  (1  I  ab  v  cd).  For  instance,  for  the 
compound  "if  heads  comes  up  then  I  win  or  if  tails  comes 
up  then  I  lose"  this  disjunction  yields  the  filter  (and 
congruence  class)  of  all  propositions  entailed  by  the 
proposition  "heads  comes  up  and  I  win  or  tails  comes  up 
and  I  lose".  By  contrast  equation  (4)  yields  the  simple 
proposition  "beads  comes  up  and  I  win  or  tails  comes  up 
and  I  lose". 

Probabilities.  By  taking  probabilities  of  equations  (4) 
and  (5)  for  disjunction  and  conjunction  and  doing  some 
rearranging,  the  following  formulas  for  the  disjunction  and 
conjunction  of  conditionals  are  obtained  in  terms  of 
standard  conditional  probabilities: 

P((alb)  v  (ckl))  =  P(b  I  b  v  d)  P(alb)  (6) 

+  P(d  I  b  v  d)  P(dd)  -  P(abcd  I  b  v  d) 

P((alb)  a  (cld))  =  P(b  I  b  v  d)  P(a^dJb)  (7) 

+  P(d  I  b  v  d)  P(c-ibld)  +  P(abcd  I  b  v  d) 

Note  that  the  last  term  in  these  two  formulas  can  be 
written  as  P(bd  lb  vd)  P(ab  I  bd).  An  important  special 
case  of  these  formulas  occurs  when  ab  <  —id  and  cd  <  — ib, 
like  when  b  and  d  are  disjoint.  Then  they  both  reduce  to: 

P((alb)  v  (cld))  =  P((alb)  a  (cld))  =  (8) 

P(b  I  b  v  d)  P(alb)  +  P(d  I  b  v  d)  P(cld). 

More  generally,  if  bi,  b2,  b3 . bm  are  pairwise  disjoint 

and  denoting  (bi  v  b2  v  b3  v  ...  v  bm)  by  b,  then 

P((ailbt)  v  (a2lb2)  v  ...  v  (anlbm))  =  (9) 

P(b  i  lb)P(a  i  lb  i )  +  P(b2lb)P(a2lb2)  +...  +  P(bmlb)P(amlbm) 


Applications 

A  probabilistically  faithful  algebra  of  conditional 
propositions  will  be  useful  for  analyzing  complex 
conditional  propositions,  for  simplifying  expert  system 
rules,  for  analyzing  iterated  (nested)  conditional 
constructions,  for  combining  and  simplifying  data  bases, 
and  for  analyzing  common  natural  language  constructions. 
In  the  following  paragraphs  somewhat  more  elaborate 
examples  of  these  will  be  illustrated. 

Complex  Conditional  Propositions 

The  disjunction  or  conjunction  of  conditionals  naturally 

arises  or  is  useful  in  various  circumstances. 

Communication  Link  Example.  Suppose  a 
military  planner  is  estimating  the  conditional  probability 
that  both  communication  links,  L]  and  L2,  that  are 
connected  in  series,  will  survive  if  they  are  attacked.  The 
planner  therefore  wants  to  calculate  the  probability  that  "if 
Li  is  attacked  (b)  or  L2  is  attacked  (d)  then  both  Li  will 
survive  (a)  and  L2  will  survive  (c)."  This  is  just  the 
conditional  (ac)  I  (b  v  d).  Now 

(ac)l(bvd)  =  (ac)  I  (bd  v  -bd  v  b-d) 

=  (ac  I  bd)  v  (ac  I  -.bd)  v  (ac  I  b-d), 

and  so  by  equation  (9), 

P((ac)  I  (b  v  d))  =  P(bd  I  b  v  d)  P(ac  I  bd) 

+  P(-bd  I  (b  vd))  P(ac  I  -.bd) 

+  P(b-d  I  (b  v  d))  P(ac  I  b-d). 

Since  (a  I  ->b)  =  (1  I  -.b),  therefore  (a  =  1)  I  -b.  So  (ac  I 
-bd)  =  (ac  I  -b)  I  d  =  (c  I  -b)  I  d  =  (c  I  -.bd)  =  (cld)  I 
(-.b).  If  the  survival  of  one  link  is  (conditionally) 
independent  of  attack  on,  or  survival  of,  the  other  link  (a 
la  J.  Pearl  [17]),  then  P(  (cld)  I  (-.b) )  =  P(cld).  So  P(ac  I 
-bd)  =  P(cJd).  Similarly  P(ac  I  b-d)  =  P(alb)  and  P(ac  I  bd) 
=  P(alb)P(ckl).  So,  since  P(al-.b)  =  1  *  P(d-d), 

P((ac)  I  (b  v  d))  =  P(bd  I  b  v  d)  P(alb)P(Jd) 

+  P(-.bd  I  (b  v  d))  P(cld) 

+  P(b-d  I  (b  v  d))  P(alb), 

which  is  a  formula  whose  values  can  be  estimated 
according  to  the  importance  of  each  term. 

If  instead,  the  links  are  connected  in  parallel,  then 
communication  will  survive  as  long  as  at  least  one  link 
survives.  So  the  relevant  compound  conditional  is 

(a  v  c)  I  (b  v  d)  =  (a  v  c)  I  (bd)  v  (avc)l  (b-d  v  -.bd) 

=  (avc)l(bd)  v  (1 1  (b-d  v  -bd)). 

The  second  equality  follows  since  one  link  is  sure  to 
survive  if  not  both  are  attacked,  and  so  by  equation  (9), 
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P((a  v  c)  I  (b  v  d))  =  P((a  v  c)  I  bd)  P(bd  i  (b  v  d)) 

+  (1)  P((b-d  v  -.bd)  1  (b  v  d)) 

=  P((a  v  c)  I  bd)  P(bd  I  (b  v  d))  +  [1  -  P(bdl(bvd)] 

=  1  -  P(bd  1  (b v d)) (1  -  P((avc)!fd)J 
=  1  -  P(bd  I  (b  v  d))  P(-a-nc !  bd), 

wbicb  is  1  minus  tbe  probability  that  both  links  will  be 
attacked  given  one  is,  times  tbe  probability  that  both 
links  will  not  suivive  given  both  are  attacked.  If  link 
non-survivali  are  conditionally  independent,  then 
PWavcJKovd))  «  1  -  P(bd  I  (b  v  d))  P(^aJb)P(-cld). 

Expert  System  Example.  Tin  A.  Nguyen  et  al  [16] 
have  given  a  nice  account  of  tbe  many  difficulties  arising 
in  expert  rule  systems  including  circularity  and 
consistency  problems  when  a  list  of  rules  is  left 
uncombined  and  unsimplified  before  chaining  of  the  rules 
is  attempted.  Such  a  conjunction  of  “if  -  then  -  "  rules 
can  cause  a  computer  to  execute  an  infinite  do-loop.  For 
example,  on  page  72  they  give  an  diagnostic  example  of 
three  rules  with  a  circularity  problem:  Denote  the 
following  propositions  as  indicated: 

T  =  proposition  that  patient  has  a  temperature  >  100°  F 
F  =  proposition  that  patient  has  a  fever 
S  =  proposition  that  patient  has  flat  pink  spots 
M  =  proposition  that  patient  has  measles. 

Rule  1:  If  T  tbenF 

Rule  2:  If  (Fand  S)  thenM 

Rule  3:  If  M  then  T 

Clearly,  if  these  rules  are  chained,  then  rule  1  chains  to 
rule  2,  rule  2  chains  to  rule  3,  and  rule  3  chains  back  to 
rule  1,  and  the  computer  program  may  never  end!  But  if 
these  three  rules  are  first  conjoined  by  equation  (5), 
(FIT)  a  (M  I  FS)  a  (T I  M)  =  (FIT)  a  (T I M)  a  (M  I  FS) 
=  ((FT)  I  (Tv  M))  a  (Ml  FS)  =  FTM-XFS)  v 
(-T-tM)MFS  V  FTMS  I  ( T  V  M  V  FS)  =  FT(-S  v  M)  I 
(T  v  M  v  FS).  So  the  three  rules  are  equivalent  to  a 
single  rule  that  "if  a  patient  has  a  temperature  over  100° 
F,  or  measles,  or  a  fever  and  flat  red  spots,  then  the 
patient  has  a  fever  and  a  temperature  over  100°  F  and 
either  measles  or  no  flat  red  spots."  By  thus  combining 
the  rules,  there  is  no  need  to  chain  and  no  danger  of  an 
infinite  loop.  (Self-chaining  can  be  disallowed.) 

Throughout  the  above  computation  there  is  no 
assumption  that  the  expert  rules  are  wholly  true;  there 
may  be  a  probability  of  each  one  holding,  and  these 
probabilities  are  not  violated  by  the  computations. 
Furthermore,  the  probability  of  any  one,  or  of  the 
conjunction  of  two,  or  of  all  three  of  the  rules  can  be 
calculated  in  principle.  For  instance,  the  probability  of  all 
three  is  P(FT-.S  v  FTM)  /  P(T  v  M  v  FS). 


Complex  Conditional  Example.  Consider  the 
familiar  experiment  of  rolling  a  single  die  once  and 
observing  the  value  r  facing  up  from  1  to  6.  What  is  the 
conditional  event  (CID)  and  conditional  probability  P(C!D) 
that  ((if  3  £  r  £  5  then  r  is  odd)  or  (if  4  <  r  then  r  is 
even)]  and  [if  r  is  even  then  r  <,  4]?  In  conditional 
notation,  [  (odd  I  3  5  r  <  5)  v  (even  1 4  <  r)  ]  a  (  r  <  4  I 
even  ]  =  ?  And  what  is  its  probability  ? 

Solution:  First  ( (odd  I  3  <  r  <  5)  v  (even  I  4  <  r)  ]  = 
[(odd)(3  S  r  <  5)  v  (evenX4  s  r))  I  [(3  <  r  <  5)  v  (4  <  r)) 
=  [(3  v  5)  v  (4  v  6)  I  (  3  <  r  )]  =  [(3  <  r)  I  (3  5  r)]. 
Furthermore,  [(3  <  r)  I  (3  <  r  )]  a  [(r  <  4)  I  even]  = 
[(3  <  r)  v  (3  <  r)']  a  [(r  S4)v  (even)']  I  [( 3  <r )  v  even] 
=  (r  <  5)  I  (2  ^  r)  =  (2  <  r  <;  5)  I  (2  5  r),  which  is  (CID) 
in  reduced  form.  So  P(CID)  =  4/5. 

Example  Distinguishing  P(a)  and  P(a  =  1).  An 
experiment  consists  of  flipping  a  fair  coin  once  and  then 
tossing  either  an  ordinary  6-sided  die  once  if  "heads" 
comes  up  on  the  coin,  or  tossing  a  (4-sided)  tetrahedron 
once  if  "tails"  comes  up  on  the  coin.  The  faces  of  the  die 
are  numbered  1  through  6  and  those  of  tbe  tetrahedron  1 
through  4.  (The  side  of  the  tetrahedron  touching  the  floor 
is  chosen  by  the  toss.)  The  set  of  possible  outcomes  is  Cl 
=  (HI.  H2,  H3,  H4,  H5,  H6,  Tl,  T2.  T3,  T4);  B  is  the 
collection  of  all  subsets  of  Cl,  and  the  probabilities  assign¬ 
ed  to  Q  are  P(Hi)  =  (1/2X1/6)  =  1/12  for  i  =  1  to  6,  and 
P(Ti)  =  (l/2)(l/4)  =  1/8  for  i  =  1  to  4.  Let  H  denote  (HI, 
H2,  H3,  H4,  H5,  H6]  and  let  T  denote  (Tl,  T2,  T3,  T4). 
The  event  (or  proposition)  "<5"  is  (HI,  H2,  H3,  H4,  Tl, 
T2,  T3,  T4)  and  its  probability  is  1  -  2/12  =  5/6.  Note 
that  P(<5  I  H)  =  2/3  and  P(<5  I  T)  =  1.  Furthermore, 
P("<5"  is  certain  I T)  =  P((”<5"  =  1)  I T)  =  1  and  P(("<5” 
=  1)  I  H)  =  0.  So  P(<5  is  certain  after  the  coin-flip)  = 
PT<5"  =  1  after  the  coin  flip)  =  P("<5"  =  1)  I H)  P(H)  + 
P("<5"  =  1)  I  T)  PTT)  =  0  x  P(H)  +  1  x  PCD  =  PCD  = 
1/2.  Thus  P("<5"  after  coin  flip)  =  5/6  but  P("<5"  is 
certain  after  coin  flip)  =  1/2. 

Something  very  similar  is  going  on  in  the  example  of  E. 
Adams  [1].  For  instance,  one  can  say  above  that  the 
probability  that  "<5"  is  highly  likely  (in  fact  certain!)  is 
1/2  but  the  probability  of  ”<5"  is  5/6,  which  might  not 
be  considered  ’’highly  likely"  and  is  anyway  a  very 
different  probability. 
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